k-connectivity for confined random networks 
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fc-connectivity is an important measure of network robustness and resilience to random faults and 
disruptions. We undertake both local and global approaches to fc-connectivity and calculate closed 
form analytic formulas for the probability that a confined random network remains fully connected 
after the removal of k — 1 nodes. Our analysis reveals that fc-connectivity is governed by local 
and not global features of the network domain, hence, our results can aid in the design of reliable 
networks, an important problem in e.g. wireless ad hoc and sensor networks. 



I. INTRODUCTION 

Random geometric networks [1] consist of a collection 
of nodes randomly scattered in a region of space, pairwise 
connected with a relative position dependent probabil- 
ity. In recent years, they have been particularly success- 
ful in modelling and understanding numerous complex 
systems related to inter alia, nanoscience [2], epidemiol- 
ogy [3], forest fires [4], social, financial, and corporate 
networks [5-7], power-grids [8], and wireless communi- 
cations [9-11]. In order to characterize different proper- 
ties of complex network, a large number of methods and 
measures have been developed [12]. These include for 
example various clustering statistics, network modular- 
ity measures, node importance and correlations between 
degrees of neighbouring nodes. 

Communication networks, provide rapid transfer of in- 
formation across huge distances, with applications rang- 
ing from the Internet, tele-medicine, intelligent trans- 
port, tracking of endangered species, hazard detection 
systems, security monitoring, etcetera. Consequently, re- 
silience to random faults or attacks are of paramount 
importance for the smooth functionality of the system. 
A popular measure of network robustness (particularly 
for communication networks) is fc-connectivity, that is, if 
any k — 1 nodes are randomly chosen and removed the re- 
maining network remains fully connected. Equivalently, 
a network is said to be fc-conncctcd if for each pair of 
nodes there exist at least fc mutually independent paths 
connecting them [1]. Fig. 1 shows examples of k = 1,2, 
and 3 connected networks. In general, the random re- 
moval of nodes may result from a technical failure (e.g. a 
software/hardware malfunction) or a random attack that 
may disrupt network functionality and lead to cascades 
of catastrophic failures [13]. 

Historically, the classical problem of fc-connectivity has 
been addressed in the asymptotic: limit of infinite network 
size and deterministic link formation whenever nodes are 
within a certain range [14, 15]. Here, we consider spa- 
tially confined networks formed by probabilistic link con- 
nections. Many real networks, although large, are not 
of infinite size and are often confined within a finite re- 
gion. Furthermore, noise, uncertainty, or the variation in 
connectivity range of individual nodes justifies the use 
of probabilistic link formations rather than determin- 
istic ones. In fact, probabilistic link-models are much 



FIG. 1: Examples of fully connected networks with N = 13 
nodes in a convex domain satisfying k — 1,2, and 3 connec- 
tivity from left to right. 



preferred in applications like wireless communications as 
they can also adequately account for small-scale scatter- 
ing effects. 

In this paper, we will derive closed form analytic for- 
mulas for the probability of a random network residing 
in arbitrary 2 and 3 dimensional convex domains to be 
fc-connected. We achieve this by undertaking both lo- 
cal and global approaches to fc-connectivity. The former 
refers to the local perspective of a single node, while the 
latter to the global perspective of large clusters of nodes. 
Contrary to the expected universal features of large net- 
works, our analysis reveals that k-connectivity is gov- 
erned by local and not global features of the network 
domain. 



II. DESCRIPTION OF THE PROBLEM 

The networks we wish to model consists of N randomly 
distributed nodes with locations r^ £ V a convex subset 
of R d , with i = 1, 2, ... N, according to a uniform den- 
sity p = N/V, where V = |V| and | • | denotes the size 
of the set using the Lebesgue measure of the appropri- 
ate dimension or the cardinality of a finite set. We con- 
sider convex geometries as we only allow for line-of-sight 
links between nodes. After deployment of the nodes in 
V, communication links between pairs of nodes arc es- 
tablished with probability H(rij), often written as H™ 
where r,j = \rj — r.j | is the distance between nodes i and 
j. Hence, the relevant network g = (5, L) is formed, con- 
sisting of the set of nodes S = {1, 2, 3, ... , N} paired by 
the collection L C {(i,j) € S 2 : i < j} of direct links. 

We maintain physical relevance by adopting a specific 
pair connectedness function Hij derived from wireless 
communication theory [17] and applicable to ad hoc and 



sensor networks. In particular, we use a Rayleigh fading 
model suitable when there is no dominant communication 
channel along a single line of sight between transmitter 
and receiver but rather a sum of many paths with phases 
evenly distributed between and 2ir. The resulting con- 
nection probability between two nodes a distance r apart 
is given by 
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where /3 sets the characteristic connection length tq and 
the parameter n is called the path loss exponent and is 
typically set to r\ = 2 corresponding to propagation in 
free space but is observed to be n > 2 for cluttered envi- 
ronments e.g. heavily built-up urban environments. Un- 
less otherwise stated, we will use 77 = 2. In doing so 
H(r) is a Gaussian function thus rendering the math- 
ematics tractable. It is worth noting that in the limit 
of i] — » 00, the connection between nodes is no longer 
probabilistic and converges to the well studied case in 
geometric graph theory [16], the unit disk model with an 
on/ off connection range at vq. 

The connectivity of the network can be measured by 
checking whether any node i can communicate in a multi- 
hop fashion with any other node j 7^ i. If this is the 
case then the network is said to be fully connected, 
or 1-connected. In computer simulations, one initiates 
a search algorithm to count the number of connected 
components (clusters). If only a single cluster is found 
then 1-connectivity is established. Typically such algo- 
rithms have computation complexity of O(NlnN). For 
2-connectivity, a random node is removed from the net- 
work and the search algorithm is run. If successful, the 
node is replaced and its original links reconnected and a 
different node is removed and the algorithm is repeated. 
If successful for all N nodes then 2— connectivity is es- 
tablished. Therefore, the computational complexity is 
now of O (N(N - 1) ln(N - 1)). For fc-connectivity this 
number grows like ~ N k (\nN). 

For a given node density p we are interested in the 
probability Pf c (k) of a randomly formed network to be 
fc-connected. Of course this depends on the connectivity 
function Hij and the domain shape. In order to pro- 
duce the 5-curve describing Pf c (k) as a function of the 
density p, one needs to perform a Monte Carlo computer 
simulation, averaging over many realizations of the above 
described algorithm. In this paper we will provide closed 
form analytical formulas which accurately predict this 
function in 2 and 3 dimensional convex domains hence 
eliminating the need for such heavy computer simula- 
tions. 



III. LOCAL APPROACH 

We adopt a bottom-up approach and begin our inves- 
tigation from the local view point of a single node. The 
probability of some node i connecting with some other 
node j is given by 

1 
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FIG. 2: a) Log-log plot of p2 vs V for square and circular 
disk domains of equal volumes illustrated with square/disk 
markers using j3 = 1. The dashed line is the asymptotic 
distribution n/(/3V). b) 3D plots of Hi(ri) for square and 
circular disk domains both of equal volumes V = 100. 



Note that Hi can be studied in detail for different pair- 
connectedness functions H(r) (e.g. anisotropic), and 
power and diversity scaling laws can be deduced through 
in-depth analysis [18, 19]. In the discussion that follows 
however, we will mainly be concerned with global net- 
work observables and concentrate on how local properties 
of nodes contribute to them. 

Since, nodes are deployed independently with a uni- 
form density, the probability that node i connects with 
exactly fc other nodes (i.e. node i is of degree k) denoted 
here by di(k), is given by the binomial distribution 
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which for H(r) a hard step function (77 = 00) would cor- 
respond to the probability of finding k nodes (excluding 
node i) in the ro-neighbourhood of i - ,;, and (N — 1 — k) 
nodes elsewhere in the available domain V. Another way 
of expressing (3) is achieved by noting that if N is large 
and Hi is small, di(k) is well approximated by the Pois- 
son distribution 
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where A^ = (N — l)Hi, and Di(k) is the corresponding 
cumulative distribution function. The Poisson approxi- 
mation is justified here as 7 > 1 thus making Hi small 
and N large. 

Integrating (2) over r^ gives P2, the probability that 
two randomly selected nodes connect to form a pair 



p 2 = yz j a jy^Od^dr^. 
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Numerical integration of (5) for a square and a circular 
disk domain of equal volumes reveals that p 2 is very much 
insensitive to the domain's shape and decays like ~ ir/f3V 
as V — >• 00 (see Fig. 2a)). This is no surprise as for both 
domains, while distinctly different, Hi remains constant 
at it j 'j3V for the majority of node positions (see Fig. 2 
b)). ' 

The average number of nodes connected to a node in a 
network is called the mean degree and is a popular mea- 
sure of robustness and reliability. From the degree dis- 
tribution (4) we can immediately deduce that the mean 



degree (as well as the variance) is just A = J Ajdr,-/V = 
(N — l)p2 and is also highly insensitive to the domain 
shape. Moreover, for N and V large, we have that 
A- (TV - 1)tt/(/3V) swp7r//3. 

Finally, we turn to investigate short range correlations 
and look at the 2-point correlation function in an infinite 
domain. Here, we keep n general and consider 3 nodes 
with polar coordinates (n, #1), (0, 0) and (r, 0) and define 
the two point correlation function as 



C(r, V ) = 
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Equation (6) is nothing more than Bayes' theorem mea- 
suring how likely it is that node 1 connects with node 3, 
given that node 1 is connected with node 2. The denomi- 
nator of (6) amounts to 2-kY (2 j r\) j r\(i 2 1 ^ . Expanding the 
integrand of the numerator in powers of r <C 1 we obtain 
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Note that for H(r) a deterministic step function (i.e. 
r\ = 00) equation (7) diverges and instead is given by the 
circle-circle intersection C(r, 00) = 1 — 2r/(xro) + 0(r 3 ). 
This short calculation suggests that two nearby nodes are 
strongly correlated for deterministic (hard) connectivity 
functions with C(0, 00) = 1. This is not the case however 
for probabilistic (soft) connectivity functions. Indeed for 
the softest case of r\ = 2 we have C(0, 2) = 1/2. It is rea- 
sonable to expect that this distinction of hard versus soft 
H(r) persists in higher dimensions and for correlations 
between n>2 nodes. 



IV. MINIMUM NETWORK DEGREE 

For a given configuration of node positions we define 
Pmd(?i, ■ ■ ■ rjv, k) as the probability of the corresponding 
network to have a minimum degree of at least k, i.e. 
every node is connected to at least k other nodes. The 
average of this quantity over all possible configurations 
P m d{k) = (P md (r 1 ,...r N ,k)) is the overall probability 
of a network with minimum degree k. We define the 
spatial average of an observable O over all possible node 
configurations as 

(°) = yw 0(ri,r2,...,rjv)dr 1 dr 2 ...dri\r. (8) 

Assuming that for N 3> 1, the degree of node i is almost 
independent of the degree of node j ^ i, we write [20] 
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Pmd(k) = (n^(degree( ri ) > k)) = (Q (1 - D. t (k - 1))) 
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The seemingly strong assumption of independence made 
here is justified when performing the spatial average (8) 



as two nodes i and j are sufficiently apart and thus uncor- 
rected for most node positions in V. Furthermore, for 
soft connectivity functions H(r), we expect only weak 
correlations between n-tuples of nodes, as argued in the 
previous section (see Eq. (7)). 

Substituting in (9) the definition of Di(k), performing 
the average and omitting terms of 0(1/N) we arrive at 
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where Mff(rj) = VHi. There are several important ob- 
servations to be highlighted here. Firstly, in the high 
density limit we expect the integrals appearing in equa- 
tion (10) to be dominated by contributions where M# (r;) 
is small. This is due to the exponential part of the inte- 
grand dominating over the power. Secondly, we expect 
that Mi{{vi) is small near the domain boundaries i.e. 
near the corners and edges (see Fig. 2 b)) which physi- 
cally correspond to the most hard to connect to regions 
of V. We conclude by noting that unlike A, P m d(k) is 
strongly influenced by the details of the domain bound- 
ary. 



V. GENERAL ANALYTIC FORMULAS 

The integrals to be approximated in (10) are of the 
form 



J m 
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where My(r 2 ) = JyH^drx. Due to the short-range in- 
teractions between nodes, hard to connect to regions be- 
come almost independent of each-other and so we approx- 
imate the integral of (11) by a sum of independent con- 
tributions due to different boundary objects [21]. That 
is, for an arbitrary convex domain Vcl 2 the integral 
of (11) can be approximated by the sum of the bulk con- 
tribution (B), and a number of edge (E) and corner (C) 
contributions 
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For instance, the "house" domain shown in Fig. 3, as far 
as I m is concerned, decomposes into a homogeneous bulk 
contribution, 5 edges and finally 5 corners. The only 
restriction in this approximation is that the individual 
boundary elements are sufficiently apart (approximately 
a distance greater than 2ro). 

We now consider each of these contributions separately, 
first for d — 2 and later for d = 3. Due to length restric- 
tions, we will omit any tedious calculations and give only 
important steps and final results. 

The bulk contribution to (12) can be obtained by ig- 
noring any boundary effects and thus considering a ho- 
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FIG. 3: Decomposition of a house domain into contributions 
from the bulk, the 5 edges and 5 corners. 



The edge contribution to (12) can be obtained by ig- 
noring any curvature effects and thus considering the pos- 
itive half plane V = [0, oo) x (— oo, oo) entailing 
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The corner contribution to (12) can be obtained by 
considering a wedge domain V = {(r, 9) : 9 € (0,1?)} in 
polar coordinates for general angle "d < ix. Expanding 
H(t 12 ) to linear order in r 2 sa 0, i.e. near the corner we 
get 
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Note that (15) can be expressed as a single term but does 
not provide further insight and so is left as such. 

We now repeat the above calculations for V C M 3 . 
Therefore, I m now decomposes into a homogeneous bulk 
contribution (B), a surface area contribution (S), and 
edge (E) and corner (C) contributions 
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For the surface area contribution, we expand H\ 2 to 
linear order in r 2 m R, where R = y / 5'/(47r) and S is 
the total surface area of the domain. In doing so we are 
effectively saying that the surface area contribution to 
(11) for an arbitrary right prism is equal to that of a 
sphere of equal surface area. Details such as corners and 

edges are ignored at this stage as they will be considered 

(s) 
separately at a later stage. We find that M H 't- 

%R-ri), 



203/ 2 



j(S) 



P 



Tlf) 



1+m 



-r 



1 



pn 



3/2 



2^ 3 / 2 



(18) 



Where two planes meet at an angle $ € (0, it), an edge 
of length L is formed. To calculate the contribution to 
(11) due to the edge we express Hi 2 in cylindrical coordi- 
nates such that the edge is centred along the z-axis. We 
expand H\ 2 about r 2 « z 2 w 0, keeping only linear terms 
and calculate 
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Finally, corners in right prisms are formed where 3 
edges come together, 2 of which are at an angle 1? and 
the 3rd is perpendicular to both. We center the corner 
at the origin with the perpendicular edge running along 
the positive z-axis. We expand Hi 2 about r 2 s=a z 2 « 
keeping only linear terms and calculate 
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In hindsight of the above calculations, it easy to see 
that 2D and 3D contributions to (11) are similar in struc- 
ture, hence hinting towards possible generalization to 
arbitrary dimensions d > 0. Furthermore, we observe 
that the dominant contribution at hight densities p comes 
from I m ' . Physically, this makes sense as P m d(k) is most 
probable to fail near the hardest to connect to region of 
V, i.e. the sharpest corner, in our case characterised by 
its angle $. 



We will restrict the discussion to domains belonging to 
the set of right prisms. As there is only one variable angle 
to consider we can benefit from the use of cylindrical 
coordinates when calculating I m and I m . 

Using spherical coordinates, we expand H\ 2 for r 2 « 

and find that for homogeneous domains M H = §372 , and 
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VI. GLOBAL APPROACH 

For a given configuration of node positions we define 
Pf c (ri, . . .rjv, k) as the probability of the correspond- 
ing network to be fc-connected. The average of this 
quantity over all possible node configurations Pf c (k) = 
(Pf c (ri, . . . Tff, k)) is the overall probability of obtaining 
a fc-connected network. It is clear that a fc-connected 
network has minimum degree k. The opposite is not true 
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FIG. 4: Top: Computer simulation of P m d(k) (filled mark- 
ers) and Pf c (k) (hollow markers) for k G [1,4] using j3 — 1 
in a house domain of sides L = 10 and L/y/2. The thick red 
curves are the analytic predictions due to (10) and approxi- 
mations (12), (13), (14), and (15). Bottom: 1 - P m d{k) and 
1 — Pfcik) on a log-linear scale. 



however and hence the former set is a subset of the latter 
and so Pf c (k) < P m d(k). For instance a network consist- 
ing of two pairs of connected nodes has minimum degree 
1 but is not 1-connected. Nevertheless, the two concepts 
are strongly correlated, particularly in the high density 
limit where the two converge [16, 20]. Indeed, in Fig. 4, 
Pf c {k) (shown in hollow markers) follows P m d(k) very 
closely from below. Understanding the subtle differences 
between Pf c (k) and P m d(k) has posed a difficult chal- 
lenge to the graph theoretic community since the early 
80's and has ever since been approached from a variety 
of different directions. Here, through simple argumenta- 
tion and the use of a cluster expansion for Pf c (l) deriving 
from statistical physics [18], we will show that Pf c {k) and 
Pmd(k) have the same asymptotic distribution. 

The probability that nodes connect (or not) leads to 
the trivial identity 1 = Hij + (1 — Hij). Multiplying over 
all possible links with nodes in S, expresses the proba- 
bility of all possible combinations. This can be written 



*= n 

(l,j)eS 2 ;i<j 



H lJ + il-H lj )\= J2 



U 



</' 



u 9 = n h h n o- - h v)- 



(21) 



(22) 



Recall that g — (S, L) is a network consisting of the set 
of nodes S = {1, 2, 3, ... , N} paired by the collection 



L Q {(i,j) G S 2 : i < j} of direct links. As a slight abuse 
of notation we have used (i,j) € g to denote that (i,j) is 
an element of the set of links L associated with g. 

The sum in equation (21) contains 2 NtyN ~ 1 ^ 2 separate 
terms and can be expressed as collections of terms deter- 
mined by their largest cluster: 
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where G s is the set of graphs with nodes in S, and Gj 
the set of graphs with nodes in 5 and largest connected 
component (cluster) of size j with 1 < j < N. We iden- 
tify the first term on the RHS of (23) as the probability 
of the associated network being 1-connected. Hence, re- 
arranging equation (23) and averaging over all possible 
node configurations we get 
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Equation (24) clearly confirms the physical picture that 
at high densities, full connectivity is simply the com- 
plement of the probability of an isolated node i.e. a 
node of degree 0. Moreover, second order corrections 
are due to scenarios involving a single cluster of size 
N — 2. At high densities it was shown that (24) is given 
by P /C (l) = l-pf v e-pM^dn [18]. 

The probability of a network to be 2-connected can 
be expressed as P fc (2) = P/ C (l) - X(l) where X(l) > 
is the probability of obtaining a fully connected network 
which is not 2-connected. This follows from the fact that 
fc-connectivity implies (k — l)-connectivity At high den- 
sities, a fully connected network which is not 2-connected 
will typically contain a single node which is of degree 1 
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for N ^> 1. Repeating the same argument k times we get 
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where X(m) is the probability of obtaining an re- 
connected network which is not (m + l)-connected. Ex- 
amples of networks where a single node prohibits (k + 1)- 
conncctivity can be seen in Fig. 1 for k = 1, 2, 3. Compar- 
ing equation (26) to (9), we can conclude that Pf c (k) and 
Pmd(k) have the same asymptotic distribution. More- 
over, it is worth noting that fc-link-connectivity, where a 
network remains connected whenever fewer than k links 
are removed, is sandwiched by P m d{k) and Pf c (k). 



VII. NUMERICAL VERIFICATION 

We numerically test our results in the house domain 
seen in Fig. 3 with sides L — 10 and Lj\f2 (using /3 = 1). 
The top panel of Fig. 4 shows in full markers the nu- 
merically obtained P m d(k) for k £ [1,4], while the S- 
shaped red curves are the analytic predictions of (10) 
using (12), (13), (14), and (15). An excellent agree- 
ment is observed, especially for high densities as can be 
seen in the lower panel of Fig. 4 depicting 1 — P m d{k) 
on a log-linear scale. The hollow markers correspond to 
Pfc(k) and closely follow P m d(k) from below. At high 
densities, the difference becomes increasingly difficult to 
make confirming that the two observables have the same 
asymptotic distribution. Numerical simulations for the 
3D case are omitted due to the heavy computational ef- 
fort required. In fact, the purpose of deriving all these 
formulas was primarily to avoid the need of such heavy 
simulations. 



VIII. CONCLUSIONS 

We have investigated the probability of forming a k- 
connccted random network Pf c (k) confined within con- 
vex 2 and 3 dimensional domains and have found that 
for probabilistic link models, Pf c {k) is governed by local 
contributions due to individual boundary effects which 
can be singled-out and analysed independently. As a re- 
sult, we have obtained accurate approximations for con- 
tributions to Pf c due to the bulk, surface area, edges and 
corners of the domain. These contributions can now be 
easily calculated for an arbitrarily complex (but convex) 
domain, and summed up to give accurate predictions to 
Pf c (k). We have confirmed the validity of our results 



through computer simulations in a 2D house domain. 

The results presented in can have direct and applica- 
ble benefits in the design of wireless multi-hop relay net- 
works where communication devices (nodes) pass mes- 
sages to each other without the need of a central router. 
Here, network resilience to random faults or attacks are 
of paramount importance for the smooth functionality 
of the system. Therefore, quantitative knowledge of the 
effects the spatial confinement has on the network's k- 
conncctivity can result in valuable design recommenda- 
tions in technologically important areas e.g. for wireless 
ad hoc and sensor networks. Similarly, one may also 
consider ways to mitigate these effects by for example 
increasing the signal power or number of communication 
channels. 

Finally, while our work relies heavily on approxima- 
tions and asymptotic expansions, it is mathematically 
tractable, intuitive and above all valid and can there- 
fore provide further insight on the difficult problem of 
resilience reliability and control [22] of large and highly 
interconnected real networks such as systems of water, 
food and fuel supply, financial transactions and power 
transmission [13], or smaller, boundary dominated ones 
involving for instance the electrical conductivity of car- 
bon nano-tubes [2]. 
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